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Nucleotide Sequences Coding For Cryptosporidium Proteins/ 
Polypeptides coded by said sequences and kits for the use thereof 

Description 

The invention concerns nucleic acids and 
5 polypeptides coded by said nucleic acids, derived from 
protozoan parasites of Cryptosporidium genus. Nucleic 
acids and peptides are advantageously used for 
developing detection assays of Cryptosporidium in 
biological samples of human and animal origin and/or in 

10 the environment. 

Cryptosporidium parasites infect the intestinal 
tract of several animal species. Over the last decade 
the number of infections in humans has dramatically 
increased. Most of the affected patients show a marked 

15 immunodeficiency, with a high incidence of AIDS. The 
immunocompromised patients develop a severe and 
frequently irreversible diarrhoea which causes 
malnutrition and represents a major factor leading to 
death. 

20 The prophylaxis of Cryptosporidium infection's 

is hampered by the lack of reliable immunoassays for 
the detection of the parasite. Moreover, by microscopic 
examination, parasite oocyst are difficult to 
discriminate from several microoganisms that are 

25 morphologically similar to Cryptosporidium , like 
Candida species. 

The development of immunological and molecular 
diagnostic assays highly specific for Cryptosporidium 
absolutely requires the biochemical characterisation 

30 and localization of parasite antigen* and the cloning 
of corresponding genes. The availability of diagnostic 
assays would permit to develop prophylactic measures 
for immunodeficient patients by detecting the parasites 
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in the environment (food, water) and in individuals 
(relatives, nurses) that may act as carriers. 

Moreover, no effective therapeutic compounds 
against Cryptosporidium infection are available; 
therefore there is the need to develop reagents able to 
prevent the onset of the infection in immunodeficient 
patients . 

The authors of the present invention have 
obtained a rabbit antiserum raised against a whole 
lysate of Cryptosporidium oocysts. The serum has been 
used to screen a genomic library of Cryptosporidium 
from infected intestinal mucosal cells, into the 
expression vector Xgtll. Among clones isolated, clone 
cpRL3 has been shown to comprise a 2359 bp (SEQ ID 
15 No.l) insert with an open reading frame coding for a 
polypeptide of 786 amino acids (SEQ ID No. 2) . Scanning 
of the 67.0 version of the "GENE BANK" data base using 
the cpRL3 sequence failed to reveal any similarity with 
known DNA sequences. 
20 Tn e cpRL3 insert has been subcloned into an 

expression plasmid and the corresponding recombinant 
polypeptide is produced in E. coli, fused at the 
N-terminus to a stretch of six histidines. The 
histidine sequence allows a fast and efficient 
25 purification by nickel chelate chromatography of the 
polypeptide encoded by cpRL3. The recombinant 
polypeptide is used as immunogen in Balb/c mice for 
producing antisera and monoclonal antibodies. The 
recombinant protein is highly immunogenic. This 
30 sequence codes for a portion of a protein of the oocyst 
wall of Cryptosporidium that has an apparent molecular 
weight of 190,000 Dalton, named "Cryptosporidium Oocyst 
Wall Protein" (COWP) . 

Nucleotide and polypeptide sequences derived 
35 from the isolated sequence are advantageously utilized 
for diagnosis of Cryptosporidium infections in patients 
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and/or animals and for analysis of environmental 
contamination by Cryptosporidium oocysts. Treatment of 
'Cryptosporidium infection may be obtained by 
administration of antibodies raised against COWP. 
5 The invention concerns also: - designing of 

specific oligonucleotides, as primers in polymerase 
chain reactions (PCR) reactions for the detection of 
the COWP DNA sequence; - the utilisation of COWP amino 
acid sequence or fragments thereof to raise antisera 

10 and/or monoclonal antibodies for the development of 
immunological assays to detect the presence of 
Cryptosporidium and/or COWP molecule; - the use of 
COWP derived polypeptides, either synthetic or 
recombinant, as components of diagnostic kits for the 

15 detection of COWP released by parasites; - the 
utilisation of COWP-derived polypeptides, either 
synthetic or recombinant, for producing antisera and/or 
monoclonal antibodies to be employed in the therapy of 
Cryptosporidiosis • 

20 It is a specific object of the present 

invention a polypeptide in a substantially purified 
form comprising a contiguous sequence coded by a 
Cryptosporidium gene, said gene comprising a nucleotide 
sequence at least 50 % homologous to the sequence of 

25 SEQ ID No.l. Preferably said contiguous sequence 
comprises an antigenic determinant of Cryptosporidium. 
More preferably said contiguous sequence is coded by 
the sequence of SEQ ID No.l, or parts thereof; most 
preferably said contiguous sequence is comprised in the 

30 aminoacid sequence of SEQ ID No. 2. 

It. is another object of the invention a 
diagnostic kit for the detection of Cryptosporidium in 
biological and environmental samples comprising, as 
specif ligand, the polypeptide according to the 

35 invention. 
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It is another object of the invention the use 
of a polypeptide according to the invention for raising 
'antibodies able to detect Cryptosporidium infection in 
biological and environmental samples. 

It is another object of the invention an 
antibody obtained using as immunogen a polypeptide 
according to the invention. 

It is another object of the invention a 
diagnostic kit for the detection of Cryptosporidium in 
biological and environmental samples comprising, as 
specif ligand, an antibody able to react with at least 
one polypeptide according to the invention. 

It is another object of the invention an 
oligonucleotide derived from a Cryptosporidium gene, 
15 said gene comprising a sequence at least 50 % 
homologous to the sequence of SEQ ID No.l. Preferably 
said oligonucleotide has a sequence comprised in the 
sequence of SEQ ID No.l, or in the complementary 
sequence of SEQ ID No.l. 
20 It is another object of the invention a 

diagnostic kit for the detection of Cryptosporidium in 
biological and environmental samples comprising, as 
specif ligand, the oligonucleotide according to the 
invention. 

25 It is another object of the invention a PCR kit 

for the amplification of Cryptosporidium DNA 
comprising, as specif primer, at least one 
oligonucleotide according to the invention. Preferably 
said PCR kit comprises two oligonucleotides having 

30 nucleotide sequences according to the invention. 

The invention will be illustrated in the 
following examples, by making reference to the 
following figures, wherein: 

- Figure 1 shows an immunoblot analysis of the 

35 mouse serum (M10/01) (A) and of a control mouse serum 
anti TRAP (B) against: the expression product of the 
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control plasmid pDS56/RBSII-E-6his-TRAP purified on 
nickel column (1); the expression product of the 
'plasmid pDS56/RBSII-E-6his-c P RL3 purified on nickel 
column (2) ; a protein lysate of E. coli cells 
transformed with pDS56/RBSII-E-6his-cpRL3 induced with 
IPTG (3); and non induced (4); molecular weight 
standard are indicated. 

- Figure 2 shows an immunoblot analysis using 
the mouse serum M10/01 (A) and the MAbl IB2 (B) against 
a protein lysate of C. parvum oocysts. 

- Figure 3A shows an electrophoresis of PCR 
products amplified from C. parvum DNA (1; 4) and from 
DNA of the plasmid pDS56/RBSII-E-6his-cpRL3 (2; 5) using 
primers combinations Cry-3/Cry-6 (1;2) or Cry-5/Cry-6 
(3; 4; 5;). As control, PCR reaction without template DNA 
(3) . Figure 3B shows an electrophoresis of PCR products 
amplified from DNA of several parasite species using 
primer combination Cry-3/Cry-6: DNA extracted from: C. 
parran (1), Sarcocystis sp. (2), Giardia lamblia (3) 
and P. falciparum (4) . Figure 3C shows an 
electrophoresis of PCR products amplified from DNA of 
progressively diluted C. parvum oocysts using the 
primer combinations Cry-3/Cry-6: molecular standards 
(1), 160 oocysts (2), 80 oocysts (3), 40 oocysts (4), 
20 oocysts (5) 10 oocysts (6). As control, PCR reaction 
is done without template DNA (7) or in the presence of 
P. falciparum DNA (8) . In panels A, B and C, lanes 6, 5 
and 1 respectively, DNA markers are 3611; 1166; 606; 
517; 396; 318; 263 bp. 

Example 1 Xgtll library with DNA extracted from C. 
parvum infected calf intestinal mucosa 

To develop a C. parvum genomic expression 
library, DNA extracted from the intestinal mucosa of an 
infected calf is used. 

A newborne calf is infected with 6 x 10 8 
oocysts of C. parvum MI ISS-1 (Pozio et al. 1992. 
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Trans. R. Soc. Trop. Med. Hyg. 86: 636-638) . After 5 
days, the gut is opened, cut into segments of 30 cm 
each and washed in phosphate-buffered saline (PBS) 
Nitro-cellulose filters (soaked in PBS) of the same 
size as the gut segments are applied to the mucosal 
side for a few seconds. Filters are progressively 
numbered and processed for DNA extraction. A small 
sample is removed from each filter and analysed by 
microscopy to determine whether parasites have been 
removed. The sample from each filter is incubated with 
1% glutaraldehyde in cacodylate buffer for 2 h. The 
filters are dehydrated through an increasing ethanol 
series, embedded in Epon, and cured at 60 °c for 24 h. 
Sections are cut at 0.2-^m thickness and stained with 
toluidine blue. The analysis reveals that nitro- 
cellulose filters remove only the superficial layer of 
mucosal cells, together with a large number of 
parasites . 

Only DNA extracted from filters that have 
removed a large number of parasites are used. DNA is 
digested with EcoRI and cloned in Xgtll EcoRI digested, 
3' end to the coding sequence of 0-galactosidase gene. 
Phage DNA with cloned inserts is packaged in vitro 
(Boehringer Mannheim in vitro packaging kit) to 
generate the library. The quality of the library is 
evaluated by analysing the sizes of a subset of inserts 
by polymerase chain reaction (P C R) with 
oligonucleotides corresponding to the flanking 
sequences of the EcoRI site of the 0-galactosidase 
gene. The library has a complexity of 4.5 X 10* plaques 
and an estimated average insert size of 1,800 bp. 

The expression library is analysed by use of 
a rabbit serum developed against purified oocysts of C 
parvum MI ISS-1. The serum is used after removal of the 
background reactivity by several absorptions on filters 
soaked with bacterial and phage lysates. Specific 
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antibodies bound to filters are detected by use of a 
second anti-rabbit antibody conjugated to alkaline 
phosphatase. An insert, named cpRL3, consisting of a 
2,359-bp open reading frame encoding a polypeptide of 
5 786 amino acids, is isolated. The lack of both a start 
codon and a stop codon indicates that the sequence 
represents part of the coding sequence of the isolated 
parasitic gene. 

Example 2 Expression o f the cpRL3 sequence in e. „h 
10 The DNA ins e*t cpRL3 is cloned in the EcoRI 

site of plasmid pDS56/RBSII-E~ 6xHis (a pDS56/RBSII 
derived plasmid containing an EcoRI site in the 
polylinker). The expression unit of this vector is 
under the control of an isopropyl-0- 
15 thiogalactopyranoside (IPTG) -inducible promoter and 
yields a fusion between a stretch of 6 histidines and 
the amino terminus of the inserted sequences (Stuber et 
al. 1990, Eur. J. Immunol 20:819-824). C pRL3 is 
expressed in E. coli M15 carrying the lac 
20 repressor-producing plasmid pUHAl. Induction is 
performed in LB medium for 4 h at 37 °C; 1 mM IPTG is 
added when the cell density reaches an optical density 
at 600 nm of 0 6. 

Example 3 Purification of recombinant voLyp eptirt* ^ 
25 His-cpRL3 ' ^ — 

The expression product of the cpRL3 sequence 
(recombinant polypeptide 6xHis-cpRL3) is purified in a 
single-step procedure, by nickel chelate affinity 
chromatography (Stuber et al., ibid.). In brief, one 

30 litre of an induced culture of M15(pUHAl) cells 
carrying plasmid pDS56/RBSII-E-6xHis-cpRL3 is harvested 
and stirred for 3 h in 100 ml of 6 m guanidine 
hydrochloride, 100 mM Na 2 HP0 4 , P H 8. The suspension is 
centrifuged at 10,000 x g, and the supernatant is 

35 directly applied to a nickel column (NTA-resin, 
Diagen) . After an equilibration step with 8 M urea, 100 
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»M NaH 2 P0 4 , 10 « iris, pH 6, 6xHis-cpRL3 is eluted by 
lower lng the P H of the urea solution stepwise to P H 4 

IIZZ Utre ^ CUltUIe ' 1 - " is 
5 Example 4 Immunoblottincr 

Parasite lysates are obtained from 
Cryptosporidium oocysts pur . f . ed fay ? 

cen tef ugation (Waldman, E., et al. 1986. / clin 

Microbiol. 23:199-200). Parasites are i ysed by 

10 incubation of pellet of 2 x W 7 oocysts with 0.2 ml of 

sample buffer (33 mM Tris-HCl, P H 6.8, 190 mM glycerol 

0 1» «, Bacterial lysates are obtained by ITZlL 
Of 109 lnduced Qr non . nduced . ceiis ^ 

of sample buffer. Proteins in total cell lysates are 

5 ZlTn b V SDS - p ° lyaCrylamide * el electrophoresis 
(PAGE) (Laemmli,U.K. 1970. Nature (London) 227:680-685) 
and electroblotted onto nitro-cellulose filters 
(blotting buffer, 25 mM Tris, 192 mM glycine, 20% 
methanol) . Non-specific adsorption of antibodies to the 
nitro-cellulose is prevented by saturation of the 
filters with 1% bovine serum albumin in 2x TEST (20 mM 
Tris-HCl, pH 8, 300 mM NaCl, 0.1% Tween 20) for 2 h at' 
room temperature. Nitrocellulose filters are incubated 
with antibodies for 2 h at room temperature. After 
extensive washing with 2x TBST, antibodies bound to the 
filters are detected by use of goat anti-mouse 
immunoglobulin (heavy and light chains) conjugated to 
alkaline phosphates (Promega) . Phosphatase activity is 
disclosed by incubation of the filters with 0.3 mg of 
30 Nitro Blue Tetrazolium and 0.15 ffig of 5 _ bromQ 
4-chloro-3-indolyl phosphate per ml in 100 mM Tris-HCl 
(pH 9.5) -100 mM NaCl-5 mM MgCl 2 . 
Example 5 Monoclonal antibody p rorinrH.n 

„ h Aft6r pUrif ication b y nickel chelate 

chromatography, recombinant polypeptide 6xHis-cpRL3 is 
used as immunogen to develop specific antisera and 
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monoclonal antibodies. BALB/c mice are immunised three 
times with 50 M of purified 6xHis-cpRL3 polypeptide in 
complete (for the first immunisation) or incomplete 
Freund's adjuvant. Five days after the "last 
immunisation, mouse spleen cells are fused with x63 Ag 
8653 myeloma cells and subsequently screened for 
antibody production (Kohler,G. and C. Milstein 1975 
Nature (London) 256:495-497). 

The supernatants of cultures from growing 
hybrids are tested in an enzyme-linked immunosorbent 
assay (ELISA) against 6xHisCpRL3. Immunised mice 
develop an antibody titer of 1/500,000. Figure 1 (A and 
B) shows the ability of one of the antisera to 
recognise specifically by immunoblot the recombinant 
polypeptide encoded by cpRL3. The antiserum recognises 
the recombinant polypeptide in a total lysate of 
bacteria in which the expression of the cpRL3 insert is 
induced with IPTG (Fig. 1A, lane 3) . The antiserum also 
recognize the polypeptide after purification by the 
nickel chelate chromatography. The specificity of the 
reaction is demonstrated by the lack of reactivity 
against E. coli proteins, as well as against an 
unrelated recombinant protein, TRAP, expressed from the 
control plasmid pDS56/RBSII-6xHis (TRAP) , also sharing 
25 the six histidine amino- terminal tail. Antibodies 
developed against the expression product of insert 
cpRL3 (mouse antiserum and monoclonal antibody 2B11) 
detect, in the lysate of Cryptosporidium oocysts, a 
protein of the apparent molecular weight of 190,000 
Dalton, (Figure 2 A and B) . These results indicates 
that the expression product of the insert cpRL3 is part 
of a Clyptosporidium protein expressed in the parasite 
oocysts. 

Example 6 Immunofluorescence microscopy 

Purified parasite oocysts are air dried on a 
covers lip and fixed in cold acetone for 5 min. Non- 
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specific .binding is prevented by P re-incubation of the 
samples in PBS containing 1% bovine albumin. Primarv 
antibodies (culture supernatant) are allowed to react 
for 40 min at room temperature, and the secondary 
fluorescinated antibody (Becton Dickinson goat 
anti-mouse) is allowed to react for 20 min. Observation 
of the samples is carried out with confocal microscopic 
apparatus (Bio-Rad Laboratories) . 

The protein is localised by immunofluorescence 
using the monoclonal antibody 2B11 on a preparation of 
oocysts fixed with acetone. The monoclonal 2B11 
specifically binds to a protein of oocyst wall of 
Cryptosporidium. The result of the immunolocalization 
is confirmed by confocal immunofluorescence analysis 
which shows that the reactivity is detected on the 
surface of the oocyst wall. On the basis of these 
observations, the Cryptosporidium protein encoded by 
the insert cpRL3 is named as Cryptosporidium oocyst 
wall protein, COWP. 

By immunofluorescence, monoclonal antibodies 
can identify as few as 100 oocysts/ml of stools. The 
detection limit of this method is determined 
resuspending known numbers of oocysts in samples of non 
infected stools. 
25 Example 7 ELISA assays 

Monoclonal antibodies produced against the 
product of the insert c P RL3 are able to recognise COWP 
xn solution when used in ELISA assays based on antigen 
capture with two antibodies. Therefore specific 
antibodies directed against this protein or fragments 
thereof having an antigenic activity, as the product 
of cpRL3, can be employed to develop diaanostic assav 
to reveal Cryptosporidium infection. 
Example 8 PCR 

For all PCR experiments, the cpRL3 sequence is 
amplified in a standard 50 „1 PCR reaction mixture 
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S Sa j ki : t R ^*',l a1 ' 1988 ' SCiSnCe 239s «"-«l; Scharf, 
S- J- et al. 1986. Science 233:1076-1078) for 35 cvclel 

LabT^ 58 ° C S6C); 72 ° C(1 ■ 

Lab Line thermal cycler. The final concentration of 

MgCl 2 is 2 am. Primers used are: 

Cry3 ( 5 ' GTCCTACTGGATTCACTCTAC-3 ' ) 

coding strand, nt. 722-742 of SEQ ID No. 1; 

Cry5 { 5 » -CCAGGACATCATCATGGTCATTCTCATGGGC-3 » ) 

coding strand, nt. 1099-1129 of SEQ ID No. i, 

Cry 6 (5»-CCGAATATGTAACACATTTATCCGC-3») 

7LT a TlnV nd ' 1828 " 1852 ° f ^P^entary 

strand of SEQ ID No. 1. 

Crvn , ^^ amplification of the cpRL3 sequence from 
Cryptosporidium oocysts, samples are incubated for 5 
nun under reducing conditions and boiled for 10 min 
thereafter. TaqI polymerase is purchased from 
Perkm-Elmer Co. 

The ability of two oligonucleotide combinations 
to amplify by PGR the sequence of cpRL3 is shown in 

CrZl , 3A * B ° th ° f 0li ™ e °tide combinations 
Cry3/cry6 and Cry5/Cry6 are able to amplify DNA 
segments of the expected molecular weight from the 
plasmid pDS56/RBSII-E-6xHis ( C pRL3) and from DNA of 
Cryptosporidium. The amplification reaction using 
oligonucleotides Cry3/Cry6 is highly specific, (figure 
3B, . Oligonucleotides in fact do not amplify any DNA 
segment when DNA of other protozoa (Giardia lamblia, 
Plasmodium falciparum o Sarcocystis suis ho minis) is 
used as template. m pgr experiments the 
oligonucleotide combination Cry3/Cry6 is able to 
amplify a specific DNA segment from as few as 40 
oocysts of Cryptosporidium, (Figure 3C) . These data 
indicate that by using combination of oligonucleotides 
corresponding different regions of C pRL3 sequence, PGR 
can be employed to detect the presence of 
Cryptosporidium. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Universita' degli Studi di Roma "La 
Sapienza" 

(B) STREET: P.le Aldo Moro 5 

(C) CITY: Rome 

(E) COUNTRY: Italy 

(F) POSTAL CODE (ZIP) : 00185 

(A) NAME: Istituto Superiore di Sanita' 

(B) STREET: V.le Regina Elena 299 

(C) CITY: Rome 

(E) COUNTRY: Italy 

(F) POSTAL CODE (ZIP) : 00161 

(ii) TITLE OF INVENTION: Nucleotide sequences coding for 
Cryptosporidium proteins, polypeptides coded by said 
sequences and kits 

(iii) NUMBER OF SEQUENCES: 2 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(EPO) 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2359 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..2359 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

GAA TTC GAA TGC CCA CCA GGT ACA ATT TTA AAA GAT GAT CAA TGT CAA 
Glu Phe Glu cys Pro Pro Gly Thr lie Leu Lys Asp Asp Gin Cys Gin 
1 5 1° 15 

TCG ATA GAA AGA GTT GAT ACA ATT TGT CCA CCA GGG TTT GTA GAT AAT 96 
Ser lie Glu Arg Val Asp Thr lie Cys Pro Pro Gly Phe Val Asp Asn 
20 25 30 

GGC GAA GAT TGT GTC CAA TTT TCT GCA CCA GAG AAA ATT TGC CCC CAA 144 
Gly Glu Asp Cys Val Gin Phe Ser Ala Pro Glu Lys lie Cys Pro Gin 
35 40 45 

GGA TTT TCT CTT TCC GGA AAA CAA TGT GTT AAA ACA GAA TCT GCT CCA 192 
S Phe Ser Leu Ser Gly Lys Gin Cys Val Lys Thr Glu Ser Ala Pro 
50 55 60 

AGA TTA ACA GAA TGC CCA CCA GGT ACA ACC TTG GAA AAT AAC AGT TGT 240 
Arg Leu Thr Glu Cys Pro Pro Gly Thr Thr Leu Glu Asn Asn Ser Cys 
65 70 



ATT TCA TAT GAA CTA GAA GAT GCC ATT TGT CCA CCT GGA TAT CTC GAC 
lie Ser Tyr Glu Leu Glu Asp Ala lie Cys Pro Pro Gly Tyr Leu Asp 

85 90 

AAT GGA TCA GAC TGT GTT CAG TTT TCT CAA CCA GAA AAG GAG TGT CCA 
A^n gS 5S-2p Cys Val Gin Phe Ser Gin Pro Glu Lys Glu Cys Pro 
100 105 

ara CCT TTT GTA TTA ATT GGA AAA CAA TGT ACC CAA ACT ACT CAA GCT . 
tS Ty Phe vll lln lie Gly Lys Gin Cys Thr Gin Thr Thr Gin Ala 
115 120 125 

CCA CCA CAA CCA 'GAG TGT CCT CCA GGT ACA AAC CTG GTA AAT GGA CAA 
Pro Pro S£ Pro Glu Cys Pro Pro Gly Thr Asn Leu Val Asn Gly Gin 
130 135 140 

TGC CAA AAA GTT GAA AGG ATA AAT ATG GTA TGT CCA ACT GGT TTT ATT 
Cys Sn" Lys Val Glu Arg He Asn Met Val Cys Pro Thr Gly Phe lie 
145 I 50 

r*T AAT GGT ACA AAT TGT GCT TCT TTC TCC GCA CCA AAC AGA GAA TGC 
Asp £n tly tS £a Cys Ala Ser Phe Ser Ala Pro Asn Arg Glu Cys 

165 l'° 

rrA m GGA TAT ACA CTT TCT GGA TCC CAA TGC GAG CAA ATA AAA GAA 
pS Pro gS ?fr tS Leu Ser Gly Ser Gin Cys Glu Gin lie Lys Glu 
180 I 85 



288 



336 



384 



432 



480 



528 



576 
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GCA CCT CCT GTT TCA GAA TGT CCA CCA GGA TAT AAA CTT CAA GGA AAT 
Ala ?rl Pro Val Ser Glu Cys Pro Pro Gly Tyr Lys Leu Gin Gly Asn 



CAA TGT ACT GCA CTA AAA ATG ATC GAT GCT ATC TGC CCA GAT GGA TTT 
Gin Cys Thr Ala Leu Lys Met lie Asp Ala lie Cys Pro Asp Gly Phe 
210 215 " u 

s £ £ ss £ s £ s £ £ i £ s £ £ £ 

ion ZOO 



225 230 



TGT CCT ACT GGA TTC ACT CTA CAA AAT CAA CAG TGT GTT CAA ACA ACT 
Cys Pro Thr Gly Phe Thr Leu Gin Asn Gin Gin Cys Val Gin Thr Thr 

245 " 

£ £ £ £ £ £ £ £ S £ S £ £ g S 5 

260 265 



£ £ £ £ £ S SS S £ E £ £ £ £ £ £ 

2 "7 5 



624 



672 



720 



768 



816 



864 



912 



GTT GGT ACT AGA GAG GGG GAC GTT TGC GTA GAG AGA TCG ATT AGT TCG 
val Gly Thr 5£ gYu Gly Asp Val Cys Val Glu Arg Ser lie Ser Ser 
290 295 300 

£ £ £ £ £ £ £ £ - £ S - ~ S £ i 960 

305 310 31S 

£ £ £ £ £ £ £ E £ S £ £ £ 3 1 £ 1008 
£ £ £ £ £ £ £ £ S S £ £ £ S S £ 1056 



£ £ £ £ £ £ .£ S £ S £ £ £ £ £ £ 1 10 4 
£ £ £ S ££ £ S S S - g - " £ £ 1152 



370 375 



£ £ £ £ £ £ £ £ £ £ £ £ £ S £ £ 1200 

385 390 395 
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CCA ATT TGT GAA GAT GTT CCA AAA ATT ACC CCA AAA ACT TGT ACA AAA 1248 
Pro He Cys Glu Asp. Val Pro Lys He Thr Pro Lys Thr Cys Thr Lys 

405 410 415 

GCT GAT TCT GTC CCA GCT GTG CCT ATT TGC GAG AAC AAT GCT GAA CTT 1296 
Ala Asp Ser Val Pro Ala Val Pro He Cys Glu Asn Asn Ala Glu Leu 
420 425 430 

GTA GGA AAA GAA TGT GTA TTA ACA AAT TAC TAC CCA TTA GAA GCA ATT 1344 
Val Gly Lys Glu Cys Val Leu Thr Asn Tyr Tyr Pro Leu Glu Ala He 
435 440 445 

TGT CAA GAT GGA ACA AGA TCA AAA GAG TGT GCT AAG TTT GTA AAA ACT 1392 
Cys Gin Asp Gly Thr Arg Ser Lys Glu Cys Ala Lys Phe Val Lys Thr 
450 455 460 

CCA CCT ACT TTA AAA TGT CCG CCA GGT TCT GTA GAT GTA GGA TCT CAA 1440 
Pro Pro Thr Leu Lys Cys Pro Pro Gly Ser Val Asp Val Gly Ser Gin 
465 470 475 480 

TGT CAA GTT AAC AAA TAT TCA CCA TAT GAT CTT GCA TGC CCT GCA GGA 1488 
Cys Gin Val Asn Lys Tyr Ser Pro Tyr Asp Leu Ala Cys Pro Ala Gly 

485 490 495 

TAT GCA TTG GTT GGA GAC AAA TGC GCT ACC ACA AGA GAA AAA GTT TGC 1536 
Tyr Ala Leu Val Gly Asp Lys Cys Ala Thr Thr Arg Glu Lys Val Cys 
500 505 510 

CCG AAT GAA AGT TGC CAA AGA GTT GTA ACT GCG CCT GTT TCT TTA ACT 1584 
Pro Asn Glu Ser Cys Gin Arg Val Val Thr Ala Pro Val Ser Leu Thr 
515 520 525 

TGT CCC CCT GGA TAT CAC CAA ATA GAT GAA GTT ATG AAT ATT TCT GCT 1*632 
Cys Pro Pro .Gly Tyr His Gin He Asp Glu Val Met Asn He Ser Ala 
530 535 540 

CAT CCA CAC CAC AGA CAC TTA GCT GGG GTT CAA TCT ACT TCT CAA AAG 1680 
His Pro His His Arg His Leu Ala Gly Val Gin Ser Thr Ser Gin Lys 
545 550 555 560 

GGA TAT TCT CAT GGA CAT AAA TAT ACT CCT GTA ATT TCT CAG CCA CCA 1728 
Gly Tyr Ser His Gly His Lys Tyr Thr Pro Val He Ser Gin Pro Pro 

565 570 575 

CAA CCA GTT CCA GTT GTT GCT CCT ATT CAG CAA ATG AAA TGC ATC CAT 1776 
Gin Pro Val Pro Val Val Ala Pro He Gin Gin Met Lys Cys He His 
580 585 590 

GCA AAC CAT GCT CCA TAT AAT CTT ATC TGT CCT GTT GGA TCA AGA CTT 1824 
Ala Asn His Ala Pro Tyr Asn Leu He Cys Pro Val Gly Ser Arg Leu 
595 600 605 
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GTA GCG GAT AAA TGT GTT ACA TAT TCG GAT AAA ATA TGT CCA AAT GGT 1872 
Val Ala Asp Lys Cys Val Thr Tyr Ser Asp Lys He Cys Pro Asn Gly 
610 615 620 

AAT TGC GAG CGT ATA TAT AAT GAG CCT GCT GAA TTA GTA TGC CCT CCA 1920 
Asn Cys Glu Arg He Tyr Asn Glu Pro Ala Glu Leu Val Cys Pro Pro 
625 630 635 640 

GGA TTC TCA TCA TCT AAA CCA ATT CAG CCA ATA AGC CAT TCT CAT ATT 1968 
Glv Phe Ser Ser Ser Lys Pro He Gin Pro He Ser His Ser His He 

645 650 655 

AAC CAT CCA AAT GTT TCT GTT CCC GTC CAA CCA CAA ACT ATT AAC CAA 2016 
Asn His Pro Asn Val Ser Val Pro Val Gin Pro Gin Thr He Asn Gin 
660 665 670 

CCA CAA GTA ATT CAA CAA AGA CAA GTA AAT TAT CAG CCA CAA GTA ATT 2064 
Pro Gin Val He Gin Gin Arg Gin Val Asn Tyr Gin Pro Gin Val He 
675 680 685 

CAT CAA ACA CAG GAA ATT TTA ACA ACT TAT CCA ACT CCA GTT TAC CAA 2112 
His Gin Thr Gin Glu He Leu Thr Thr Tyr Pro Thr Pro Val Tyr Gin 
690 695 700 

ACC GGC ACA ATT TAT CAA GGA CAT CAT CAT CAT CAT CAT CAT CAT CAC 2160 
Thr Gly Thr He Tyr Gin Gly His His His His His His His His His 
705 710 715 720 

AGA AAT CTA GCT TCC CCT GAG TGC ATT AAG ACA ATT TCA GTA CCT TAT 2208 
Arg Asn Leu Ala Ser Pro Glu Cys He Lys Thr He Ser Val Pro Tyr 

725 730 735 

ATT TTA AAA TGC GAA TCT CCA TTT ATT TTA GAT GGC GAC AAA TGT ATC 2256 
He Leu Lys Cys Glu Ser Pro Phe He Leu Asp Gly Asp Lys Cys He 
740 745 750 

GAA AAA ACA GAA AAA ATT TGT CTA CAA GGT GAC TGC AGA AAA CAA GTC 2304 
Glu Lys Thr Glu Lys He Cys Leu Gin Gly Asp Cys Arg Lys Gin Val 
755 760 765 

GTC GTT CCA CCA ACT CTT TCA TGT CCA CAA GGT TAC AGA AAT GCC AAC 2352 
Val Val Pro Pro Thr Leu Ser Cys Pro Gin Gly Tyr Arg Asn Ala Asn 
770 775 780 

2359 

GGA ATT C 
Gly He 
785 



WO 94/04681 



PCT/IT93/00089 



17 



(2) INFORMATION FOR SEQ ID NO: 2: 

(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Glu Phe Glu Cys Pro Pro Gly Thr He Leu Lys Asp Asp Gin Cys Gin 
15 10 15 

Ser He Glu Arg Val Asp Thr He Cys Pro Pro Gly Phe Val Asp Asn 
20 25 30 

Gly Glu Asp Cys Val Gin Phe Ser Ala Pro Glu Lys He Cys Pro Gin 
35 40 45 

Gly Phe Ser Leu Ser Gly Lys Gin Cys Val Lys Thr Glu Ser Ala Pro 
50 55 60 

Arg Leu Thr Glu Cys Pro Pro Gly Thr Thr Leu Glu Asn Asn Ser Cys 
65 70 75 80 

He Ser Tyr Glu Leu Glu Asp Ala He Cys Pro Pro Gly Tvr Leu Asp 

85 90 * 95 

Asn Gly Ser Asp Cys Val Gin Phe Ser Gin Pro Glu Lys Glu Cys Pro 
100 105 110 

Thr Gly Phe Val Leu He Gly Lys Gin Cys Thr Gin Thr Thr Gin Ala 
115 120 125 

Pro Pro Gin Pro Glu Cys Pro Pro Gly Thr Asn Leu Val Asn Gly Gin 
130 135 140 

Cys Gin Lys Val Glu Arg He Asn Met Val Cys Pro Thr Gly Phe He 
145 150 155 160 

Asp Asn Gly Thr Asn Cys Ala Ser Phe Ser Ala Pro Asn Arg Glu Cys 

165 170 175 

Pro Pro Gly Tyr Thr Leu Ser Gly Ser Gin Cys Glu Gin He Lys Glu 
180 185 190 

Ala Pro Pro Val Ser Glu Cys Pro Pro Gly Tyr Lys Leu Gin Gly Asn 
195 200 205 
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Gin Cys Thr Ala Leu L ys Ile ^ Ua IU ffl prQ ^ ^ ^ 



220 



Leu Pro Asn Gly Asp Asp Cys Ile Gin Phe Ser 



230 



235 



Pro Ala Ser Thr Val 



240 



Cys Pro Thr Gly P he Iht Leu Gln Asn Gln ^ Cys ^ ^ ^ ^ 

250 255 
Thr ser Pro Lys Thr Pro Glu Cys Pro Pro Gly Ser Ala Leu Asp Gly 



270 



Asp. ser Cys Thr Arg Leu Val Pro Gly Ala Leu Gln Tyr Val 



280 



285 



Cys Pro 



Val Gly Thr Arg Glu Gly Asp Val Cys Val Glu Arg Ser lie Ser Ser 

295 300 

Pro Val Leu Glu Cys Pro Pro Gly Tyr Ser Leu Glu Thr Gly Lys Gln 

315 320 

Cys Val Arg Arg Ser Gln Tyr Asp Cys Ser Val Thr Thr Tyr Val Thr 

330 335 

Glu cys Lys Thr Pro Asp Val Lys Ala Leu Arg Ar, Leu Ala Ala Ala 

345 350 
I-ys Glu Thr ser Thr Val Tyr Glu Thr Ser Glu He Gln Asn Pro Gly 



365 



His His His Gly His Ser His Gly His Ser His Ser Gln 

375 380 
lie Gln Thr Gln Asn lie His Thr Gln His Hi 



Val lie Pro 



390 



395 



s Lys Glu Ala Pro Arg 



400 



Pro lie cys Glu Asp Val Pro Lys He Thr Pro Lys Thr Cys Thr Lys 
Ala Asp ser Val Pro Ala Val Pro lie Cys Glu Asn Asn Ala Glu Leu 



425 



Val Gly Lys Glu Cys Val Leu Thr Asn Tyr Tyr Pro Leu 



440 



445 



430 



Glu Ala lie 



Cys Gln Asp Gly Thr Arg Ser Lys Glu Cys Ala Lys Phe Val Lys Thr 



460 



Pro Pro Thr Leu Lys Cys Pro Pro Gly Ser Val Asp Val Gly 

470 475 



Ser Gln 
480 
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Cys Gin Val Asn Lys Tyr Ser Pro Tyr Asp Leu Ala Cys Pro Ala Glv 

485 A an - - wx .y 



490 



495 



Tyr Ala Leu Val Gly Asp Lys Cys Ala Thr Thr Arg Glu Lys Val Cys 
500 ertc * 



505 



510 



Pro Asn Glu Ser Cys Gin Arg Val Val Thr Ala Pro Val Ser Leu Thr 
515 520 



525 



530 HiS 51? 116 ASP G1U Val M6t Asn Ile Ser Ala 



535 



540 



His Pro His His Arg His Leu Ala Gly Val Gin Ser Thr Ser Gin Lys 



550 



555 



560 



Gly Tyr Ser His Gly His Lys Tyr Thr Pro Val He Ser Gin Pro P 



565 



570 



ro 



575 



Gin Pro Val Pro Val Val Ala Pro Ile Gin Gin Met Lys Cys Ile His 
580 585 590 

Ala Asn His Ala Pro Tyr Asn Leu Ile Cys Pro Val Giy Ser Arg Leu 

595 600 605 

Val Ala Asp Lys Cys Val Thr Tyr Ser Asp Lys Ile Cys Pro Asn Gly 
610 615 620 

Asn Cys Glu Arg He Tyr Asn Glu Pro Ala Glu Leu Val Cys Pro Pro 



630 



635 



640 



Gly Phe Ser Ser Ser Lys Pro Ile Gin Pro Ile Ser His Ser His Ile 



645 



650 



655 



Asn His Pro Asn Val Ser Val Pro Val Gin Pro Gin Thr Ile Asn Gin 
660 665 670 

Pro Gin Val Ile Gin Gin Arg Gin Val Asn Tyr Gin Pro Gin Val He 
675 680 685 

His Gin Thr Gin Glu Ile Leu Thr Thr Tyr Pro Thr Pro Val Tyr Gin 
g 9° 695 700 

Thr Gly Thr He Tyr Gin Gly His His His His His His His His His 
705 710 715 . 720 

Arg Asn Leu Ala Ser Pro Glu Cys Ile Lys Thr Ile Ser Val Pro Tvr 

725 730 735 

He Leu Lys Cys Glu Ser Pro Phe Ile Leu Asp Gly Asp Lys Cys Ile 
740 745 750 
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Glu Lys Thr Glu Lys lie Cys Leu Gin Gly Asp Cys Arg Lys Gin Val 



760 



765 



Val Val Pro Pro Thr Leu Ser Cys Pro Gin Gly Tyr Arg Asn AlaAsn 



775 



780 



Gly He 
785 
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Claims 



1- A polypeptide in a substantially purified form 
comprising a contiguous sequence coded by a 
Cryptosporidium gene, said gene comprising a nucleotide 
5 sequence at least 50 % homologous to the sequence of 
SEQ ID No.l. 

2. A polypeptide according to Claim 1 wherein said 
contiguous sequence comprises an antigenic determinant 
of Cryptosporidium. 
10 3. A polypeptide according to any of previous claims 
wherein said contiguous sequence is coded by the 
sequence of SEQ ID No.l, or parts thereof. 

4. A polypeptide according to Claim 3 wherein said 
contiguous sequence is comprised in the aminoacid 

15 sequence of SEQ ID No. 2. 

5. A diagnostic kit for the detection of 
Cryptosporidium in biological and environmental samples 
comprising, as specif ligand, the polypeptide according 
to the invention. 

20 6. use of a polypeptide according to the invention 
for raising antibodies able to detect Cryptosporidium 
infection in biological and environmental samples. 
7. An antibody obtained using as immunogen a 
polypeptide according to any of Claims from 1 to 4. 

25 8. A diagnostic kit for the detection of 
Cryptosporidium in biological and environmental samples 
comprising, as specif ligand, an antibody able to react 
with at least one polypeptide according to any of 
Claims from 1 to 4. 

9. An oligonucleotide derived from a Cryptosporidium 
gene, said gene comprising a sequence at least 50 % 
homologous to the sequence of SEQ ID No.l. 

10. An oligonucleotide according to Claim 9 having a 
sequence comprised in the sequence of SEQ ID No.l, or 
in the complementary strand of SEQ ID No. 1. 



30 



35 
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A m kit according to Claim 12 comprisiaa a s 
1. ; P o?i f 0 PriDer ' «it Cla^ 



